arXiv:1507.02618v2 [cs.DS] 8 Sep 2016 


Sublinear Distance Labeling 


Stephen Alstrup*^, Spren Dahlgaard^^, Mathias Baek Tejs Knudsen*^^, and Ely Porat^ 

^University of Copenhagen, {s . alstrup, soerend, knudsendOdi .ku.dk 
^Bar-Ilan University, poratelyScs. biu .ac.il 

September 9, 2016 


Abstract 

A distance labeling scheme labels the n nodes of a graph with binary strings such that, 
given the labels of any two nodes, one can determine the distance in the graph between 
the two nodes by looking only at the labels. A il-preserving distance labeling scheme only 
returns precise distances between pairs of nodes that are at distance at least D from each 
other. In this paper we consider distance labeling schemes for the classical case of unweighted 
graphs with both directed and undirected edges. 

We present a 0{^\o^ D) bit D-preserving distance labeling scheme, improving the 
previous bound by Bollobas et. al. [SIAM J. Discrete Math. 2005]. We also give an almost 
matching lower bound of P(^). With our Z3-preserving distance labeling scheme as a building 
block, we additionally achieve the following results: 

1 . We present the first distance labeling scheme of size o{n) for sparse graphs (and hence 
bounded degree graphs). This addresses an open problem by Gavoille et. al. [J. Algo. 2004], 
hereby separating the complexity from distance labeling in general graphs which require 
D(n) bits. Moon [Proc. of Glasgow Math. Association 1965]. 

2 . For approximate r-additive labeling schemes, that return distances within an additive 
error of r we show a scheme of size O ■ P°biogMog ^ j. ^ 2. This improves on the 

current best bound of O (”) by Alstrup et. al. [SODA 2016] for sub-polynomial r, and is a 
generalization of a result by Gawrychowski et al. [arXiv preprint 2015] who showed this for 
r = 2. 


* Research partly supported by the FNU project AlgoDisc - Discrete Mathematics, Algorithms, and Data 
Structures. 

^Research partly supported by Mikkel Thorup’s Advanced Grant from the Danish Council for Independent 
Research under the Sapere Aude research career programme. 

^This result for sparse graphs was made available online in a preliminary version of this paper [?]• The label 
size was subsequently slightly improved by an O(loglogn) factor by Gawrychowski et al. |26| . 
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1 Introduction 


The concept of informative labeling schemes dates back to Brener and Folkman naiiii and 
was formally introduced by Kannan et al. |3I1|35]. A labeling scheme is a way to represent a 
graph in a distributed setting by assigning a bit string (called a label) to each node of the graph. 
In a distance labeling scheme we assign labels to a graph G from a family Q (i.e. all forests, 
bounded-degree graphs, or planar graphs with n nodes) such that, given only the labels of a pair 
of nodes, we can compute the distance between them without the need for a centralized data 
structure. When designing a labeling scheme the main goal is to minimize the maximum label 
size over all nodes of all graphs G in the family Q. We call this the size of the labeling scheme. 
As a secondary goal some papers consider the encoding and decoding time of the labeling scheme 
in various computational models. In this paper we study the classical case of unweighted graphs. 

Exact distances The problem of exact distance labeling in general graphs is a classic problem 
that was studied thoroughly in the 1970/80’s. Graham and Poliak m and Winkler m showed 
that labels of size [(n — 1) • log 2 3] suffice in this case. Combining |31| and |34) gives a lower 
bound of [n/2] bits (see also [25]). Recently, Alstrup et al. [^ improved the label size to 
-I- 0(log^ n) bits. 

Distance labeling schemes have also been investigated for various families of graphs, providing 
both upper and lower bounds. For trees, Peleg |36) showed that labels of size O(log^n) suffice 
with a matching lower bound by Gavoille et. al |25| . Gavoille et. al |25] also showed a D(n^/^) 
lower bound for planar graphs and bound for bounded degree (and thus sparse) graphs 

and for weighted graphs this was recently improved by Abboud and Dahlgaard [T] to 0,{^/nlogn) 
even for bounded planar graphs. Gavoille et al. |25) provided an 0{^/nlogn) labeling scheme 
for planar graphs (even weighted), however nothing better than the 0(n) scheme for general 
graphs is known for bounded-degree graphs. It has remained a major open problem in the field of 
labeling schemes whether a scheme of size 0(\/n) or even o(n) exists for bounded-degree graphs 
as stated in e.g. |25] . 

Other families of graphs studied include distance-hereditary |23) . bounded clique-width 
some non-positively curved plane |16] . as well as interval |24] and permutation graphs m 

Approximate distances For some applications, the Vt{poly{n)) requirement on the label size 
for several graph classes is prohibitive. Therefore a large body of work is dedicated to labeling 
schemes for approximating distances in various families of graphs lalllslEolESlEHlESlEalEelETl 
l38l l39] . Such labeling schemes often provide efficient implementations of other data structures 
like distance oracles |38j and dynamic graph algorithms j^. 

In |36| a labeling scheme of size 0(log^ n - k- was presented for approximating distances 
up to a factoi|^of In |38j a scheme of poly-logarithmic size was given for planar graphs 

when distances need only be reported within a factor of (1 -|- e). Labeling schemes of additive 
error have also been investigated. For general graphs Alstrup et. al jS] gave a scheme of size 
0{n/r) for r-additive distance labeling with r ^ 2 and a lower bound of was given by 

Gavoille et al. |22j . For r = 1 a lower bound of D(n) can be established by observing that such 
a scheme can answer adjacency queries in bipartite graphs, which require n/4 bits to label for 
adjacency. 

Distance preserving An alternative to approximating all distances is to only report exact 
distances above some certain threshold D. A labeling scheme, which reports exact distances for 

^This does not break the Girth Conjecture, as the labeling scheme may under-estimate the distance as well. 
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nodes u, v where dist{u, v) ^ D is called a D-preserving distance labeling sc/iem^ Bollobas et 
al. [I2] introduced this notion and gave a labeling scheme of size 0{^ log^ n) for both directed 
and undirected graphs. They also provided an logD) lower bound for directed graphs. 

1.1 Related work 

A problem closely related to distance labeling is adjacency labeling. For some classes such as 
general graphs the best-known lower bounds for distance is actually that of adjacency. Adjacency 
labeling has been studied for various classes of graphs. In the label size for adjacency in 
general undirected graphs was improved from n/2 + O(logn) [31[ IM] to optimal size n/2 + 0(1), 
and in [6j adjacency labeling for trees was improved from log 2 n + 0(log* n) [10] to optimal size 
\og2n + 0 ( 1 ). 

Distance labeling schemes and related 2-hop labeling are used in SIGMOD and is central 
for some real-world applications miiEiEQ]. Approximate distance labeling schemes have found 
applications in several helds such as reachability and distance oracles |38| and communication 
networks [36]. An overview of distance labeling schemes can be found in |8|. 

1.2 Our results 

We address open problems of |8| 121125] improving the label sizes for exact distances in sparse 
graphs, r-additive distance in general graphs, and D-preserving distance labeling. We do this by 
showing a strong relationship between O-preserving distance labeling and several other labeling 
problems using D-preserving distance labels as a black box. Thus, by improving the result of 
|12j we are able to obtain the first sublinear labeling schemes for several problems studied at 
SODA over the past decades. Our results hold for both directed and undirected graphs and are 
summarized below. 

Sparse graphs: We present the first sublinear distance labeling scheme for sparse graphs 
giving the following theorem: 

Theorem 1. Let Sn denote the family of unweighted graphs on n nodes with at most 
edges. Then there exists a distance labeling scheme for Sn with maximum label size o(n). 

As noted, prior to this work the best-known bound for this family was the 0(n) scheme of 
|8] for general graphs. Thus, Theorem separates the family of sparse graphs from the family 
of general graphs requiring fl(n) label size. Our result uses a black-box reduction from sparse 
graphs to the D-preserving distance scheme of Theorem below. The result of Theorem was 
made available online in a preliminary version of this paper [^ and was subsequently slightly 
improved by Gawrychowski et al. |26j by noting, that one of the steps in the construction of our 
D-preserving distance scheme can be skipped when only considering sparse graph^ 

Approximate labeling schemes: For r-additive distance labeling Gawrychowski et al. |26j 
showed that a sublinear labeling scheme for sparse graphs implies a sublinear labeling scheme for 
r = 2 in general graphs. We generalize this result to r ^ 2 by a reduction to the D-preserving 
scheme. We note that a reduction to sparse graphs does not suffice in this case, and the scheme 
of |26| thus only works for r = 2. More precisely, we show the following: 

^In this paper we adopt the convention that the labeling scheme returns an upper-bound if dist{u, v) < D. 
The scheme presented in this paper has labels of length O ( ”^ , where A = . In [2S] they 

improve the exponent of the polylog A term from 2 to 1. 
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Theorem 2. For any r ^ 2, there exists an approximate r-additive labeling schemes for the 
family On of unweighted graphs on n nodes with maximum label size 


Qi 'O' polylog(r log n) 
logn 


Theorem ^improves on the previous best bound of O (”) by [ 8 ] whenever r = , e.g. 


when r = poly logn. 


ZD-preserving labeling schemes: For D-preserving labeling schemes we show that: 

Theorem 3. For any integer D e [1, n\, there exists a D-preserving distance labeling scheme for 
the family On of unweighted graphs on n nodes with maximum label size 

o(^^max{log^T),l}^ . 

Theorem [^improves the result of |12| by a factor of 0(log^ n/log^ D) giving the hrst sublinear 
size labels for this problem for any D = w(l). This sublinearity is the main ingredient in showing 
the results of Theorems and Our scheme uses sampling similar to that of |12| . By sampling 
fewer nodes we show that not “too many” nodes end up being problematic and handle these 
separatel }0 

Finally, we show the following almost matching lower bound for undirected graphs extending 
the construction of m for directed graphs. 

Theorem 4. A D-preserving distance labeling scheme for the family On of unweighted and 
undirected graphs on n nodes require label size when D is an integer in [l,n — 1]. 


2 Preliminaries 

Throughout the paper we adopt the convention that Igx = max(log 2 x, 1) and logx = Inx. 
When X ^ 0 we dehne Igx = 1. In this paper we assume the word-RAM model, with word size 
w = ©(logn). If s is a bitstring we denote its length by |s| and will also use s to denote the 
integer value of s when this is clear from context. We use s o s' to denote concatenation of bit 
strings. Finally, we use the Elias 7 code |19| to encode a bitstring s of unknown length using 2|s| 
bits such that we may concatenate several such bitstrings and decode them again. 

Labeling schemes A distance labeling scheme for a family of graphs 0 consists of an encoder 
e and a decoder d. Given a graph G e 0 the encoder computes a label assignment cq '■ V(G) — > 
{0,1}*, which assigns a label to each node of G. The decoder is a function such that given any 
graph G e 0 and any pair of nodes u,v e V {G) we have d{eG{u),eG{v)) = distG{u, v). Note that 
the decoder is oblivious to the actual graph G and is only given the two labels eG{u) and eG{v). 

The size of a labeling scheme is defined as the maximum label size |eG(ri)| over all graphs 
G e 0 and all nodes u e V{G). If for all graphs G e 0 the mapping cg is injective we say that 
the labeling scheme assigns unique labels (note that two different graphs G,G' e 0 may share a 
label). 

If the encoder and graph is clear from the context, we will sometimes denote the label of a 
node u by £{u) = CGiu). 

Various computability requirements are sometimes imposed on labeling schemes 

®We note that after making this result available online in a preliminary version [7], the bound of Theorem]^ 
was slightly improved by Gawrychowski et al. |26| to 0{j^ logD). 
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3 D-preserving distance labeling schemes 

In this section we will prove Theorem Observe hrst that for D = 1 Theorem is exactly the 
classic problem of distance labeling and we may use the result of | 8 ]. We will therefore assume 
that D ^ 2 for the remainder of this paper. Let us first formalize the definition of a D-preserving 
distance labeling scheme. 

Definition 1 . Let D be a positive integer let ^ be a family of graphs. For each graph G e Q 
let cg ■ y{G) {0,1}* be a mapping of nodes to labels. Let d : {0,1}* x {0,1}* —> Z be a 
decoder. If e and d satisfy the following two properties, we say that the pair (e, d) is a D-distance 
preserving labeling seheme for the graph family Q. 

1. d{eG{u), eciv)) ^ distG{u,v) for all u,ve G for any GeQ. 

2. d^eciu), eciv)) = distciu^v) for all u,v e G with distciu^v) ^ D for any GeQ. 

The idea of the labeling scheme presented in this section is to hrst make a labeling scheme 
for distances in the range [D, 2D] and use this scheme for increasingly bigger distances until all 
distances of at least D are covered. Loosely speaking, the scheme is obtained by sampling a set 
of nodes R, such that most shortest paths of length at least D contain a node from R. Then all 
nodes are partitioned into sick and healthy nodes adding the sick nodes to the set R. All nodes 
then store their distance to each node of R and healthy nodes will store the distance to all nodes, 
for which the shortest path is not covered by some node in R. 

з. 1 A sample-based approach 

As a warm-up, we first present the log^ n) scheme of Bollobas et al. in |12) with a slight 
modihcation. 

Given a graph G = {V,E) e Q we pick a random multiset R<^V consisting of [c • ^ log n] 
nodes for a constant c to be decided. Each element of R is picked uniformly and independently 
at random from V (i.e. the same node might be picked several times)[^ We order R arbitrarily 
as (u)i,..., and assign the label of a node ue V as 

£{u) = distciu, rci) o distciu, W 2 ) o ... o distciu, rc|R|) 

Lemma 1 . Let u and v be two nodes of some graph GeQ. Set 

d = ramdistciu^w) + distciv^w) . (1) 

weR 

Then d ^ distciu^v) and d = distG{u,v) if R contains a node from a shortest path between u 
and V. 

Proof. Let z e R he the node corresponding to the minimum value of (??). We then have 
d = distG{u,z) + distG{z,v). By the triangle inequality this implies d ^ dist{u,v). 

Now let p be some shortest path between u and n in G and assume that z e p. Then 
distciu, v) = distciu, z)+distG{z, v), implying that d ^ distciu, v), and thus d = dist{u, v). □ 

By Lemma it only remains to show that the set R is likely to contain a node on a shortest 
path between any pair of nodes u,v eV with distciu, v) ^ D. 

Lemma 2. Let R be defined as above. Then the probability that there exists a pair of nodes 

и, v eV such that distciu, v) ^ D and no node on the shortest path between u and v is sampled 
is at most n^~^. 

®In 1121 they instead picked R by including each node of G with probability 


5 




Proof. Consider a pair of nodes u,v e V with distciu, v) ^ D. Let p be a shortest path between 
u and V, then \p\ ^ D. Each element of R has probability at least D/n of belonging to p 
(independently), so the probability that no element of R belonging to p is at most 



n 


\R\ 



^ exp(—clogn) = n 


( 2 ) 


Since there are at most such pairs, by a union bound the probability that there exists a pair 
u, V with distciu, v) ^ D, such that no element on a shortest path between u and v is sampled 
in R is thus at most nP ■ n~^ = □ 


By setting c > 2 we can ensure that the expected number of times we have to re-sample 
the set R until the condition of Lemma is satisfied is 0(1). The labels can be assigned using 
0(|i?| logn) = O(^log^n) bits as each distance can be stored using O(logn) bits. 


3.2 A scheme for medium distances 

We now present a scheme, which preserves distances in the range [0,20] using 0(]jlog^0) 
bits. More formally, we present a labeling scheme such that given a family of unweighted graphs 
Q the encoder, e, and the decoder, d, satishes the following constraints for any G e Q: 

1. d{eG{u), eciv)) ^ distG{u,v) for any u,v e G. 

2 . d{eG{u), eGiv)) = distG{u,v) for any u,v e G with distG{u,v) e [0,20]. 

Let such a labeling scheme be called a [D,2D^-preserving distance labeling scheme. 

The labeling scheme is based on a sampling procedure similar to that presented in Section |3.1[ 
but improves the label size by introducing the notion of sick and healthy nodes. Below we 
described only the labeling scheme for undirected graphs. We note that this can be turned into a 
labeling scheme for directed graphs by at most doubling the label size. In our undirected labeling 
scheme we store distances to several nodes in the graph, and for a directed scheme one simply 
needs to store both distances to and from these nodes. This will be evident from the description 
below. 

Let G = iy, E) e Q. We sample a multiset R of size 2 • ^ logD. Similar to Section 
element of R is picked uniformly at random from V. 

Definition 2. Let R be as defined above and fix some node u. We say that a node v is uncovered 
for u if distG{u, v) ^ D and no node in R is contained in a shortest path between u and v. A 
node u with more than ^ uncovered nodes is called sick and all other nodes are called healthy. 

Let S denote the set of sick nodes and let uc(u) denote the set of uncovered nodes for u. The 
main outline of the scheme is as follows: 

1. Each node u stores the distance from itself to each node of i? u S. 


3.1 each 


2. If u is healthy, u stores the distance from itself to every v e uc(u) for which distG{u, v) e 
[D,2D]. 

We start by showing that the set of sick nodes has size 0{n/D) with probability at least 1/2. 
This is captured by the following lemma. 


Lemma 3. Let R be defined as above and let S be the set of sick nodes. Then 


Pr 


n 


|5| ^ 2-J ^ 1/2 . 
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Proof. Fix some node u e V and let n e i/ be a node such that distciu, v) ^ D. Using the same 
argument as in (??) of Lemma we see that the probability that v is uncovered for u is at most 
D~^. Therefore E[|uc(w)|] ^ By Markov’s inequality we have 


Pr[rt is sick] = Pr 


n 


|uc(u)| > D - 


1 

^ D ’ 


and thus E[|S'|] ^ n/D. We again use Markov’s inequality to conclude that 


Pr 


\Sl>2f 


^ 1/2 . 


□ 


The goal is now to store the distances to the nodes of i? u S as well as uc(u) using few bits. 
First consider the distances to the nodes of i? u S'. Observe that since we only wish to recover 
distances in the interval [D, 2D] we only need to store distances to the nodes of i? u S which are 
at most 2D away. Let u be any node in G. We will store the distances from u to the relevant 
nodes of i? u S as follows: We first hx a canonical ordering of the nodes in Ru S, which is the 
same for all nodes u e G. For each node of i? u S in order we now store either a 0-bit if its 
distance to u is greater than 2D. Otherwise we store the distance using at most 0(logD) bits. 

We may now assign the label i{u) of a node u to be id{u) concatenated with the bitstring 
resulting from the above procedure for i? u S'. If tt is healthy we concatenate an identiher for the 
set uc{u) of uncovered nodes restricted to nodes within distance [H, 2D] along with the distance 
to each of these nodes. The decoder works by simply checking if one nodes stores the others 
distance or by taking the minimum of going via any node in i? u S. 


Label size In order to bound the size of the label we first observe that R u S has size at 
most logH) and we can thus store the distance (or a 0-bit) to each of these nodes using 
log^ D) bits. We thus only need to bound the size of storing id’s and distances to the nodes 
of uc{u) whose distance is in [Zl, 2D]. Since we only store this for healthy nodes this set has size 
at most n/D and can be described using at most 


O 





bits. Since each distance can be stored using 0(logD) bits we conclude that the total label size 
is bounded by O (log^ D]. 

Theorem 5. There exists a [D,2D]-preserving distance labeling scheme for the family On of 
unweighted graphs on n nodes with maximum label size 



Proof. This is a direct corollary of the discussion above. 


□ 


3.3 Bootstrapping the scheme 

In order to show Theorem |3] we will concatenate several instances of the label from Theorem |5l 
First define £d{u) to be the [I?, 2T)]-preserving distance label for the node u assigned by the 
scheme of Theorem Now assign the following label to each node u: 

£{u) = £d{u) o £2d{u) o £iD{u) o ... o > (3) 
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where k = \ig{n/D)\. Let be the distance returned by running the decoder of Theorem]^ 
on the corresponding component, of the label ^{u). Then we let the decoder of the full 

labeling scheme return 

d = mm{dD,d2D, ■ ■ ■ ,d2kj^) , ( 4 ) 

with k defined as above. We are now ready to prove Theorem 

Proof of Theorem^ Consider any pair of nodes u, v in some graph G e On and let d = distciu, v). 
Also, let d be the value returned by the decoder for i{u) and i{v). If d ^ D we have d^ d. Now 
assume that d e [2* • D, 2*“*'^ • D] for some non-negative integer i. Then, by Theorem]^ and (??) 
we have d = d. 

The size of the label assigned by (??) is bounded by 


teW-D)! 00 

2 


i=0 


i=0 


-log2(Z))2 


» .2 ^ 1 


i=l 




log^(T))) 


□ 


3.4 Lower bound 


D+l 


and let L and R be sets of k nodes which make up the left 


Proof of Theorem Let k = 
and right side of a bipartite graph respectively. Furthermore, let each node of R be the first node 
on a path of D nodes. 

Consider now the family of all such bipartite graphs (L, R) with the attached paths. There 
are exactly 2^ such graphs. 

Now observe, that a node u e L is adjacent to a node v e R ii and only if dist{u, w) = D, 
where w is the last node on the path starting in v. By querying all such pairs (u, w) we obtain 
k"^ bits of information using only 2k labels, thus at least one label of size 


2k 


n 

D+l 



is needed. Since the graph has ^ n nodes this implies the result. 


□ 


This is illustrated in Figure 


4 Sparse and bounded degree graphs 


We are now ready to prove Theorem In fact we will show the following more general lemma: 

Lemma 4. Let PLn,m denote the family of unweighted graphs on n nodes with at most m edges. 
Then there exists a distance labeling scheme for PLn,m with maximum label size 


O (^ • log^ , where D 


logn 

1 + log ^ 
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Figure 1: 


Illustration of the graph family used in the proof of Section 


3.4 


Since m+n = ‘^(1) when m = it will suffice to prove Lemma In order to do so 

we first show the following lemma for bounded-degree graphs: 

Lemma 5. Let Bn{L^) he the family of graphs on n nodes with maximum degree A. There exists 
a distance labeling scheme for Bn{LiT) with maximum label size 



where D 


logn 
1 -I- log A 


log n 


and 


Proof. Suppose we are labeling some graph G e Bn{^) and let u e G. Let D = i_|_ 2 iog a 
let (.d{u) be the D-distance preserving label assigned by using Theorem with parameter D. 
Using this label we can deduce the distance to all nodes of distance at least D to u. 

Since G 6 Bn{/S.) there are at most A^ = 0{\/n) nodes closer than distance D to u. Thus, 
we may describe the IDs and distances of these nodes using at most 0{^/nlogn) bits. This gives 
the desired total label size of 


\£{u)\ = o{yn\og 


n 


n 


D 


— log D] = O — log D 


n 


D 


□ 


Using this result we may now prove Lemma by reducing to the bounded degree case in 
Lemma This has been done before e.g. in distance oracles Enm. 


Proof of Lemma Let G e PLn,m be some graph and let k = max {, 3}. Let u€ G he some 
node with more than k incident edges. If no such node exists, we may apply Lemma directly 
and we are done. Otherwise we split u into \deg{u)/{k — 2)] nodes and connect these nodes with 
a path of 0-weight edges. Denote these nodes u^,..., ttD®g(0/(fc-2)l^ Pqj. edge {u, v) in G we 
assign the end-point at u to a node u* with deg(tt*) < k. This process is illustrated in Figure]^ 
Let the graph resulting from performing this process for every node u e G he denoted by 
G'. We then have A(G') ^ k. Furthermore it holds that for every pair of nodes u,v e G we 
have distG{u,v) = distdu^,v^). Consider now using the labeling scheme of Lemma on G' 
and setting i{u) = l{u^) for each node u e G. We note that splitting nodes in the graph results 
in a weighted graph with weights 0 and 1. However, one can observe that the labeling scheme of 
Theorem actually preserves distances for nodes who have at least D edges on a shortest path 
between them. It thus follows that this is actually a distance labeling scheme for G. The number 
of nodes in G' is bounded by 


L 

ueG 


deg(u) 

k-2 


ueG 


f deg(u) 
\k-2 



2m 

k-2 


+ n = 0{n) 
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which means that Lemma gives the desired label size. 


□ 




>k 



<k-2 <k-2 <k-2 



Figure 2: Illustration of the transformation from sparse graph to bounded degree graph. 


5 Additive error 

We will now show how we can use our Z?-preserving labeling scheme of Theorem to generalize 
the 2-additive distance labeling scheme of Gawrychowski et al. |26) . We will assume that r ^ 
for simplicity. 

Let t = rlog^^n and let D = We describe the scheme in three parts: 

1. Let be a copy of G, where an edge is added between any pair of nodes whose distance 
is at most r/2 in G. Let be the set of nodes in G'' with degree at least t and let S' be a 
minimum dominating set of in Gr- Then IS"! = 

For all nodes u e G we store dist{u, v) and id{v) for all v e S. 

2. Consider now the subgraph of G induced by V\V^^. For a node u ^ let Bu{D) be the 

ball of radius D around u in this induced subgraph Then \Bu{D)\ ^ = 0{-\/n). This 

follows from the definition of There are at most t nodes within distance r/2 from u 
and thus at most nodes within distance r from u, etc. 

For all u ^ we store dist{u,v) and id{v) for all v 6 Bu{D). 

3. Finally we store a D-preserving distance label for all u e G. 

The total label size is then 

O L . (log(rlogn))2^ = O f —^ • polylog(logn • r)) , 

\ t r log n / \ r log n / 

as stated in Theorem |2] 

Decoding To see that the distance between two nodes u and v can be calculated within an 
additive error r we split into several cases: 

• If dist{u, v) ^ D we can report the exact distance between u and v using the D-preserving 
distance scheme. 

• If dist{u, v) ^ D and degc^iv) ^ t we can find a node z e S such that dist{z, v) ^ r/2 and 
thus 

dist{u, z) + dist{z, v) ^ dist{u, v) + dist{v, z) + dist{z, v) ^ dist{u, v) + r , 
and symmetrically if degcriu) ^ t. 

• Finally, if dist{u,v) ^ D and degG^{u) < t and degcriv) < L then we v e Bu{D) and we 
can thus report the exact distance between u and v. 
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